151 research outputs found

    Classifying DNA repair genes by kernel-based support vector machines

    Get PDF
    Human longevity is a complex phenotype that has a significant genetic predisposition. Like other biological processes, ageing process is governed through the regulation of signaling pathways and transcription factors. The DNA damage theory of ageing suggests that ageing is a consequence of un-repaired DNA damage accumulation. Intensive research has been carried out to elucidate the role of DNA repair systems in the ageing process. Decision Trees and Naive Bayesian Algorithm are two data-mining based classification methods for systematically analyzing data about human DNA repair genes. In this paper we develop a linearly combined kernel with Support Vector Machine (SVM) to analyze the ageing related data. The popular supervised learning algorithm enables better discrimination between ageing-related and non-ageing-related DNA repair genes. The linear combination of linear kernel and polynomial kernel of degree 3 in conjunction with SVM allows better classification accuracy in DNA repair gene data set. Compared to Decision Trees and Naive Bayesian Algorithm, SVM with the proposed kernel can achieve 65% AUC (Area Under ROC Curve) values, in contrast to 51.1% and 52.1% respectively. More importantly, we obtain 5 significant ageingrelated genes selected through the training on the whole data set and they are PCNA, PARP, APEX1, MLH1 and XRCC6. Different from the two methods, we can identify another important gene PCNA in the pathways the two methods targeted, while they failed to. And two novel genes PARP, MLH1 are selected as well. The two genes might provide potential insights for biologists in ageing research. SVM is a powerful and robust classification algorithm that can yield higher predictive accuracies. The selection of proper kernel plays a more important role in fulfilling the classification task. The important genes identified not only can target critical pathways related to ageing but also detected genes that may reveal possible related ageing biomarkers

    On Modeling Economic Default Time: A Reduced-Form Model Approach

    Full text link
    In the aftermath of the global financial crisis, much attention has been paid to investigating the appropriateness of the current practice of default risk modeling in banking, finance and insurance industries. A recent empirical study by Guo et al.(2008) shows that the time difference between the economic and recorded default dates has a significant impact on recovery rate estimates. Guo et al.(2011) develop a theoretical structural firm asset value model for a firm default process that embeds the distinction of these two default times. To be more consistent with the practice, in this paper, we assume the market participants cannot observe the firm asset value directly and developed a reduced-form model to characterize the economic and recorded default times. We derive the probability distribution of these two default times. The numerical study on the difference between these two shows that our proposed model can both capture the features and fit the empirical data.Comment: arXiv admin note: text overlap with arXiv:1012.0843 by other author

    On Reduced Form Intensity-based Model with Trigger Events

    Full text link
    Corporate defaults may be triggered by some major market news or events such as financial crises or collapses of major banks or financial institutions. With a view to develop a more realistic model for credit risk analysis, we introduce a new type of reduced-form intensity-based model that can incorporate the impacts of both observable "trigger" events and economic environment on corporate defaults. The key idea of the model is to augment a Cox process with trigger events. Both single-default and multiple-default cases are considered in this paper. In the former case, a simple expression for the distribution of the default time is obtained. Applications of the proposed model to price defaultable bonds and multi-name Credit Default Swaps (CDSs) are provided

    On Pricing Basket Credit Default Swaps

    Get PDF
    In this paper we propose a simple and efficient method to compute the ordered default time distributions in both the homogeneous case and the two-group heterogeneous case under the interacting intensity default contagion model. We give the analytical expressions for the ordered default time distributions with recursive formulas for the coefficients, which makes the calculation fast and efficient in finding rates of basket CDSs. In the homogeneous case, we explore the ordered default time in limiting case and further include the exponential decay and the multistate stochastic intensity process. The numerical study indicates that, in the valuation of the swap rates and their sensitivities with respect to underlying parameters, our proposed model outperforms the Monte Carlo method
    • …
    corecore